Model Selection

Multi-dialect Support

# Multi-dialect Support

Xiyansql QwenCoder 3B 2504

XiYanSQL-QwenCoder-3B-2504 is the latest SQL generation model released by XGenerationLab. Optimized through fine-tuning and GRPO training, it supports multiple dialects and delivers efficient and accurate SQL generation capabilities.

Large Language Model

Safetensors Supports Multiple Languages

Xiyansql QwenCoder 7B 2504

A fine-tuned SQL generation model based on QwenCoder, supporting multiple dialects with excellent performance

Text Generation

Safetensors Supports Multiple Languages

Mms 300m Arabic Dialect Identifier

This model is fine-tuned from MMS-300m for Arabic dialect speech recognition, capable of identifying Modern Standard Arabic and four major Arabic dialects.

Audio Classification

Transformers Arabic

Whisper Small Tel

A speech recognition model fine-tuned on Telugu audio datasets based on OpenAI Whisper-large-v2

Speech Recognition

Transformers Other

A high-quality Arabic speech synthesis model fine-tuned based on F5-TTS, supporting diverse pronunciations and accents from different regions

Speech Synthesis Supports Multiple Languages

Audiox South V1

AudioX is a multilingual automatic speech recognition model developed by Jivi AI, specifically optimized for South Indian languages, supporting Tamil, Telugu, Kannada, and Malayalam.

Speech Recognition Other

A 7-billion-parameter model fine-tuned on CodeLlama, specifically designed for natural language to SQL tasks, supporting multiple SQL dialects and 16k context length processing

Large Language Model

Transformers Supports Multiple Languages

Indic Whisper Hi Multi Gpu

IndicWhisper is a cutting-edge speech recognition model optimized for Indian languages, excelling in various benchmarks for Indian languages.

Speech Recognition Other

Whisper Base Arabic

An Arabic speech recognition model based on Whisper-base, fine-tuned on multiple Arabic datasets, specializing in Arabic speech-to-text tasks

Speech Recognition

Transformers Supports Multiple Languages

Arat5 Arabic Dialects Translation

This model is trained on Arabic dialect datasets for translating Arabic dialects into Modern Standard Arabic (MSA).

Machine Translation

Transformers Arabic

Speecht5 Finetuned Fleurs Zh

A Chinese text-to-speech model fine-tuned on the fleurs dataset based on microsoft/speecht5_tts

Speech Synthesis

Whisper Small Cv11 French

A French automatic speech recognition model fine-tuned based on openai/whisper-small, trained on the Common Voice 11.0 French dataset, supporting case sensitivity and punctuation prediction.

Speech Recognition

Transformers French

Whisper Telugu Base

A Telugu automatic speech recognition (ASR) model fine-tuned based on OpenAI Whisper-base, trained on multiple public Telugu datasets

Speech Recognition Other

Whisper Medium Ar

A speech recognition model fine-tuned on Arabic datasets based on openai/whisper-medium

Speech Recognition

Whisper Large Sme

A Northern Sami speech recognition model fine-tuned on Whisper-large-v2, achieving a word error rate of 24.91% on the test set

Speech Recognition

Transformers Other

Opus Mt Tc Big Ar En

This is a neural machine translation model for Arabic to English translation, part of the OPUS-MT project.

Machine Translation

Transformers Supports Multiple Languages

Wav2vec2 Large Hindicone

This model is a speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-xls-r-300m, supporting Hindi.

Speech Recognition

Wav2vec2 Xlsr Romansh Sursilvan

This model is an automatic speech recognition model fine-tuned on the Romansh-Sursilvan dialect dataset based on facebook/wav2vec2-xls-r-1b, achieving a word error rate (WER) of 13.82% on the Common Voice 8 test set.

Speech Recognition

Wav2vec2 Large Xls R 300m Ha Cv8

A Hausa speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Transformers Other

Beto Sentiment Analysis

A sentiment analysis model trained based on the BETO Spanish BERT model, supporting POS/NEG/NEU three-class sentiment classification

Text Classification Spanish

Bert Base Arabertv02 Twitter

A BERT model optimized for Arabic dialects and tweets, pre-trained on 60 million Arabic tweets with MLM tasks, with added support for emojis and common vocabulary.

Large Language Model

Transformers Arabic

Bert Large Arabertv02 Twitter

AraBERTv0.2-Twitter is a pre-trained language model optimized for Arabic dialects and tweets, developed based on the BERT architecture, with added support for emojis and common vocabulary.

Large Language Model

Transformers Arabic

Albert Large Arabic

Arabic pretrained version of ALBERT large model, trained on approximately 4.4 billion words of Arabic corpus

Large Language Model

Transformers Arabic

Wav2vec2 Xls R 300m Zh HK Lm V2

An automatic speech recognition model based on XLS-R architecture, optimized for Cantonese (zh-HK), fine-tuned on the Common Voice dataset and enhanced with a 5-gram language model.

Speech Recognition

Bp Cetuc100 Xlsr

Wav2vec2 model fine-tuned for Brazilian Portuguese using the CETUC dataset, trained with approximately 145 hours of Brazilian Portuguese speech data

Speech Recognition

Transformers Other

Wav2vec2 Large Xls R 300m Hindi

This is an automatic speech recognition (ASR) model fine-tuned on Hindi speech datasets based on Facebook's wav2vec2-xls-r-300m model

Speech Recognition

Transformers Other

Albert Xlarge Arabic

An Arabic version of the ALBERT Xlarge pretrained language model, trained on approximately 4.4 billion words, supporting Modern Standard Arabic and some dialectal content.

Large Language Model

Transformers Arabic

An automatic speech recognition model fine-tuned on Marathi datasets based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Transformers Other

StephennFernandes

Wav2vec2 Large Xlsr 53 Chinese Zh Cn Gpt

A Chinese (zh-CN) speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-large-xlsr-53

Speech Recognition

Transformers Chinese

Wav2vec2 Large Xls R 300m Galician

This is an automatic speech recognition model fine-tuned on Galician speech datasets based on facebook/wav2vec2-xls-r-300m.

Speech Recognition

Transformers Other

Bert Medium Arabic

Pre-trained Arabic BERT medium language model, trained on approximately 8.2 billion words of Arabic text resources

Large Language Model Arabic

Bert Large Arabertv2

AraBERT is a pre-trained language model based on Google's BERT architecture, specifically designed for Arabic natural language understanding tasks.

Large Language Model Arabic

Xls R 2b Nl V2 Lm 5gram Os2 Hunspell

A CTC model based on XLS-R with a 5-gram language model from Open Subtitles, primarily used for automatic speech recognition in Dutch and Flemish.

Speech Recognition

Transformers Other

Ara DialectBERT

A BERT model for Arabic dialects, further trained on the HARD-Arabic-Dataset based on CAMeL-Lab's bert-base-camelbert-msa-eighth model

Large Language Model Arabic

Bert Base Arabic Camelbert Mix Ner

An Arabic named entity recognition model fine-tuned based on CAMeLBERT Mix, supporting entity recognition in Modern Standard Arabic, dialects, and Classical Arabic

Sequence Labeling

Transformers Arabic

Bert Base Arabic Camelbert Msa Did Nadi

A dialect identification model fine-tuned based on the CAMeLBERT Modern Standard Arabic model, supporting 21 Arabic dialect identifications.

Text Classification

Transformers Arabic

XLSR 300M Nynorsk

A Nynorsk automatic speech recognition model based on the XLSR-300M architecture, trained on the NPSC dataset with low word error rate and character error rate.

Speech Recognition

A Transformer-based machine translation model for Swedish to Chinese, supporting multiple Chinese variants, developed by the Helsinki-NLP team

Machine Translation

Transformers Supports Multiple Languages

This is a machine translation model from Chinese to German based on the transformer-align architecture, supporting translation from various Chinese dialect variants such as Mandarin and Cantonese to German.

Machine Translation

Transformers Supports Multiple Languages

This is a Norwegian T5-based model trained on the Norwegian Colossal Corpus (NCC) using TPU v3-8.

Large Language Model Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase